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TITLE OF THE INVENTION 

G PROTEIN-COUPLED RECEPTOR RESEMBLING THE THROMBIN 
RECEPTOR 

5 CROSS-REFERENCE TO RELATED APPLICATIONS 
Not applicable. 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
Not applicable. 

10 

REFERENCE TO MICROFICHE APPENDIX 
Not applicable. 

FIELD OF THE INVENTION 
15 This invention relates to a novel human cDNA encoding HG52, a G 

protein-coupled receptor (GPCR) having homology to protease-activated receptors 
such as the thrombin receptors, the protein encoded by the cDNA, and methods of 
identifying selective agonists and antagonists of the protein encoded by the cDNA. 

20 BACKGROUND OF THE INVENTION 

G-protein coupled receptors (GPCRs) are a very large class of 
membrane receptors that relay information from the exterior to the interior of cells. 
GPCRs function by interacting with a class of heterotrimeric proteins known as G- 
proteins. Most GPCRs function by a similar mechanism. Upon the binding of 

25 agonist, a GPCR catalyzes the dissociation of guanosine diphosphate (GDP) from the 
a subunit of G proteins. This allows for the binding of guanosine triphosphate (GTP) 
to the a subunit, resulting in the disassociation of the a subunit from the P and y 
subunits. The freed a subunit then interacts with other cellular components, and in 
the process passes on the extracellular signal represented by the presence of the 

30 agonist. Occasionally, it is the freed P and y subunits which transduce the agonist 
signal. 

GPCRs possess common structural characteristics. They have seven 
hydrophobic domains, about 20-30 amino acids long, linked by sequences of 
hydrophilic amino acids of varied length. These seven hydrophobic domains 
35 intercalate into the plasma membrane, giving rise to a protein with seven 
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transmembrane domains, an extracellular amino terminus, and an intracellular 
carboxy terminus (Strader et al., 1994, Ann. Rev. Biochem. 63:101-132; Schertler et 
al., 1993, Nature 362:770-7721; Dohlman et al., 1991, Ann. Rev. Biochem. 60:653- 
688). 

5 GPCRs are expressed in a wide variety of tissue types and respond to a 

wide range of ligands, e.g., protein hormones, biogenic amines, peptides, lipid derived 
messengers, etc. Given their wide range of expression and ligands, it is not surprising 
that GPCRs are involved in many pathological states. This has led to great interest in 
developing modulators of GPCR activity that can be used pharmacologically. For 
10 example, Table 1 of Stadel et al., 1997, Trends Pharmacol. Sci. 18:430-437, lists 37 
different marketed drugs that act upon GPCRs. Accordingly, there is a great need to 
understand GPCR function and to develop agents that can be used to modulate GPCR 
activity. 

The thrombin receptors are GPCRS that are members of the seven 

15 transmembrane domain receptor family. The first discovered thrombin receptor has a 
relatively long amino-terminal extracellular domain. This extracellular domain serves 
as a substrate for thrombin. Thrombin cleaves the extracellular domain at a site 41 
amino acids from its N terminus, creating a new N terminus. This new N terminus 
activates the thrombin receptor by functioning as a tethered, intramolecular ligand 

20 (Vu et al., 1991, Cell 64:1057-1068). The thrombin receptors may be responsible for 
mediating the effects of thrombin in platelets, endothelial cells, smooth muscle cells, 
fibroblasts, mesangial cells, and neural cells. For a review of the thrombin receptors, 
see Dennington & Berndt, 1994, Clin. Exper. Pharmacol. Physiol. 21:349-358. 

The medical importance of the thrombin receptors is underscored by a 

25 consideration of the variety of effects attributable to thrombin: conversion of 

fibrinogen to fibrin in plasma; activation of clotting factors V, VIII, XIII, and protein 
C; modulation of the procoagulant function of platelets and endothelial cells; and the 
stimulation of platelet activation (Davey & Luscher, 1967, Nature 216:857-858; 
Coughlin et al., 1992, J. Clin. Invest. 89:351-355); chemotaxis of monocytes (Bar- 

30 Shavit et al., 1983, Science 220:728-731) and lymphocytes (Bizios et al, 1985, 

Thrombosis Res. 38:425-431); mitogenesis of lymphocytes and mesenchymal cells 
such as vascular smooth muscle cells, fibroblasts, and epithelial cells (Chen & 
Buchanan, 1975, Proc. Natl Acad. Sci. USA 72:131-135; Reddan et al., 1982, Invest. 
Ophthal. Visual Sci. 22:486-493). Modulation of the activity of the thrombin 

35 receptors is one possibility for modulating the above-described effects of thrombin. 
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U.S. Patent No. 5,686,597 discloses a homologue of the human 
thrombin receptors. 

SUMMARY OF THE INVENTION 
5 The present invention is directed to a novel human cDNA that encodes 

a G-protein coupled receptor, HG52. The DNA encoding HG52 is substantially free 
from other nucleic acids and has the nucleotide sequence shown in SEQ.ID.NO.: 1. 
Also provided is an HG52 protein encoded by the novel cDNA sequence. The HG52 
protein is substantially free from other proteins and has the amino acid sequence 
10 shown in SEQ.ID.NO.:2. Methods of expressing HG52 in recombinant systems and 
of identifying agonists and antagonists of HG52 are provided. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the complete cDNA sequence of HG52 
15 (SEQ.ID.NO.:l). 

Figure 2 show the complete amino acid sequence of HG52 
(SEQ.ID.NO.:2). 

Figure 3 A-B shows the translation of the HG52 open reading frame. 
The nucleotide sequence shown is (SEQ.ID.NO.:!). The amino acid sequence shown 
20 is (SEQ.ID.NO.:2). 

Figure 4 shows the results of a Northern blot of the expression of 
HG52 mRNA in various human tissues. 

Figure 5 shows an alignment of the amino acid sequence of HG52 with 
the amino acid sequence of the human thrombin receptor 1 (SEQ.ID.NO. :3). 

25 

DETAILED DESCRIPTION OF THE INVENTION 
For the purposes of this invention: 

"Substantially free from other proteins" means at least 90%, preferably 
95%, more preferably 99%, and even more preferably 99.9%, free of other proteins. 

30 Thus, an HG52 protein preparation that is substantially free from other proteins will 
contain, as a percent of its total protein, no more than 10%, preferably no more than 
5%, more preferably no more than 1%, and even more preferably no more than 0.1%, 
of non-HG52 proteins. Whether a given HG52 protein preparation is substantially 
free from other proteins can be determined by such conventional techniques of 

35 assessing protein purity as, e.g., sodium dodecyl sulfate polyacrylamide gel 
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electrophoresis (SDS-PAGE) combined with appropriate detection methods, e.g., 
silver staining or immunoblotting. 

"Substantially free from other nucleic acids" means at least 90%, 
preferably 95%, more preferably 99%, and even more preferably 99.9%, free of other 
5 nucleic acids. Thus, an HG52 DNA preparation that is substantially free from other 
nucleic acids will contain, as a percent of its total nucleic acid, no more than 10%, 
preferably no more than 5%, more preferably no more than 1%, and even more 
preferably no more than 0.1%, of non-HG52 nucleic acids. Whether a given HG52 
DNA preparation is substantially free from other nucleic acids can be determined by 

10 such conventional techniques of assessing nucleic acid purity as, e.g., agarose gel 

electrophoresis combined with appropriate staining methods, e.g., ethidium bromide 
staining, or by sequencing. 

"Functional equivalent" means a receptor which does not have exactly 
the same amino acid sequence as naturally occurring HG52, due to alternative 

15 splicing, deletions, mutations, substitutions, or additions, but retains substantially the 
same biological activity as HG52. Such functional equivalents will have significant 
amino acid sequence identity with naturally occurring HG52. Genes and cDNA 
encoding such functional equivalents can be detected by reduced stringency 
hybridization with a DNA sequence encoding naturally occurring HG52. For the 

20 purposes of this invention, naturally occurring HG52 has the amino acid sequence 
shown as SEQ.ID.NO.:2 and is encoded by SEQ.ID.NO.:l. A nucleic acid encoding 
a functional equivalent has at least about 50% identity at the nucleotide level to 
SEQ.ID.NO.:l. 

A polypeptide has "substantially the same biological activity" as HG52 
25 if that polypeptide has a K<i for a ligand that is no more than 5-fold greater than the 
Kd of HG52 having SEQ.ID.NO.:2 for the same ligand. 

A "conservative amino acid substitution" refers to the replacement of 
one amino acid residue by another, chemically similar, amino acid residue. Examples 
of such conservative substitutions are: substitution of one hydrophobic residue 
30 (isoleucine, leucine, valine, or methionine) for another; substitution of one polar 
residue for another polar residue of the same charge (e.g., arginine for lysine; 
glutamic acid for aspartic acid). 

By "isolated HG52 protein" is meant HG52 protein that has been 
isolated from a natural source. Use of the term "isolated" indicates that HG52 protein 
35 has been removed from its normal cellular environment. Thus, an isolated HG52 
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protein may be in a cell-free solution or placed in a different cellular environment 
from that in which it occurs naturally. The term isolated does not imply that an 
isolated HG52 protein is the only protein present, but instead means that an isolated 
HG52 protein is at least 95% free of non-amino acid material (e.g., nucleic acids, 
lipids, carbohydrates) naturally associated with the HG52 protein. Thus, an HG52 
protein that is expressed in bacteria or even in eukaryotic cells which do not naturally 
(i.e., without human intervention) express it through recombinant means is an 
"isolated HG52 protein." 

One aspect of this invention is the identification and cloning of a novel 
G protein-coupled receptor (GPCR), substantially free from other proteins, designated 
as HG52. 

Another aspect of this invention are nucleic acids which encode the 
HG52 G protein-coupled receptor. These nucleic acids are substantially free from 
other nucleic acids. 

The present invention provides a cDNA molecule substantially free 
from other nucleic acids having the nucleotide sequence shown in Figure 1 as 
SEQ.ID.NO.:l. SEQ.ID.NO.rl contains an open reading frame (positions 23-1,099 of 
SEQ.ID.NO.:l) encoding a protein of 359 amino acids (see Figure 3A-B). Thus, the 
present invention also provides a DNA molecule substantially free from other nucleic 
acids comprising the nucleotide sequence of positions 23-1,099 of SEQ.ID.NO.:l. 
The present invention also provides recombinant DNA molecules comprising the 
nucleotide sequence of positions 23-1,099 of SEQ.ID.NO.:l. 

Based on its predicted amino acid sequence, the HG52 protein most 
likely represents a novel G-protein coupled receptor (GPCR) of the rhodopsin family 
since the HG52 protein contains many of the characteristic features of GPCRs of the 
rhodopsin family, viz. 

(a) seven transmembrane domains; 

(b) homology with members of the rhodopsin family of GPCRs; 

(c) signature motifs of GPCRs in the rhodopsin family. 
Northern blot analysis (Figure 4) showed that HG52 RNA is widely 

expressed in humans as a transcript of about 4.5 kb, especially in cells of the immune 
system (peripheral blood lymphocytes (PBLs), spleen, bone marrow, lymph nodes). 
This argues for a role for HG52 in immune system function. 

The novel DNA sequences of the present invention encoding HG52, in 
whole or in part, can be linked with other DNA sequences, i.e., DNA sequences to 
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which HG52 is not naturally linked, to form "recombinant DNA molecules" 
containing HG52. The novel DNA sequences of the present invention can be inserted 
into vectors in order to direct recombinant expression of HG52. Such vectors may be 
comprised of DNA or RNA; for most purposes DNA vectors are preferred. Typical 
5 vectors include plasmids, modified viruses, bacteriophage, cosmids, yeast artificial 
chromosomes, and other forms of episomal or integrated DNA that can encode HG52. 
One skilled in the art can readily determine an appropriate vector for a particular use. 

Included in the present invention are cDNA sequences that hybridize 
to SEQ.ID.NO.:l under stringent conditions. By way of example, and not limitation, 

10 a procedure using conditions of high stringency is as follows: Prehybridization of 

filters containing DNA is carried out for 2 hr. to overnight at 65°C in buffer composed 
of 6X SSC, 5X Denhardt's solution, and 100 j^g/ml denatured salmon sperm DNA. 
Filters are hybridized for 12 to 48 hrs at 65°C in prehybridization mixture containing 
100 ng/ml denatured salmon sperni DNA and 5-20 X 10^ cpm of 32p_j a beled probe. 

15 Washing of filters is done at 37°C for 1 hr in a solution containing 2X SSC, 0.1% 

SDS. This is followed by a wash in 0.1 X SSC, 0.1% SDS at 50°C for 45 min. before 
autoradiography. 

Other procedures using conditions of high stringency would include 
either a hybridization step carried out in 5XSSC, 5X Denhardt's solution, 50% 

20 formamide at 42°C for 12 to 48 hours or a washing step carried out in 0.2X SSPE, 
0.2% SDS at 65°C for 30 to 60 minutes. 

Reagents mentioned in the foregoing procedures for carrying out high 
stringency hybridization are well known in the art. Details of the composition of 
these reagents can be found in, e.g., Sambrook, Fritsch, and Maniatis, 1989, 

25 Molecular Cloning: A Laboratory Manual , second edition, Cold Spring Harbor 

Laboratory Press. In addition to the foregoing, other conditions of high stringency 
which may be used are well known in the art. 

The degeneracy of the genetic code is such that, for all but two amino 
acids, more than a single codon encodes a particular amino acid. This allows for the 

30 construction of synthetic DNA that encodes the HG52 protein where the nucleotide 
sequence of the synthetic DNA differs significantly from the nucleotide sequence of 
SEQ.ID.NO.:l, but still encodes the same HG52 protein as SEQ.ID.NO.:l. Such 
synthetic DNAs are intended to be within the scope of the present invention. If it is 
desired to express such synthetic DNAs in a particular host cell or organism, the 

35 codon usage of such synthetic DNAs can be adjusted to reflect the codon usage of 
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that particular host, thus leading to higher levels of expression of HG52 protein in the 
host. 

Another aspect of the present invention includes host cells that have 
been engineered to contain and/or express DNA sequences encoding HG52. Such 
5 recombinant host cells can be cultured under suitable conditions to produce HG52. 
An expression vector containing DNA encoding HG52 can be used for expression of 
HG52 in a recombinant host cell. Recombinant host cells may be prokaryotic or 
eukaryotic, including but not limited to, bacteria such as E. coli, fungal cells such as 
yeast, mammalian cells including, but not limited to, cell lines of human, bovine, 

10 porcine, monkey, and rodent origin, and insect cells including but not limited to, 
Drosophila and silkworm derived cell lines. Cell lines derived from mammalian 
species which are suitable for recombinant expression of HG52 and which are 
commercially available, include but are not limited to, L cells L-M(TK") (ATCC CCL 
1.3), L cells L-M (ATCC CCL 1.2), 293 (ATCC CRL 1573), Raji (ATCC CCL 86), 

15 CV-1 (ATCC CCL 70), COS-1 (ATCC CRL 1650), COS-7 (ATCC CRL 1651), 
CHO-K1 (ATCC CCL 61), 3T3 (ATCC CCL 92), NIH/3T3 (ATCC CRL 1658), 
HeLa (ATCC CCL 2), CI 271 (ATCC CRL 1616), BS-C-1 (ATCC CCL 26) and 
MRC-5 (ATCC CCL 171). 

Human embryonic kidney (HEK 293) cells and Chinese hamster ovary 

20 (CHO) cells are particularly suitable for expression of the HG52 protein because these 
cells express a large number of G-proteins. Thus, it is likely that at least one of these 
G-proteins will be able to functionally couple the signal generated by interaction of 
HG52 and its ligands, thus transmitting this signal to downstream effectors, 
eventually resulting in a measurable change in some assayable component, e.g., 

25 cAMP level, expression of a reporter gene, hydrolysis of inositol lipids, or 
intracellular Ca2+ levels. 

Other cells that are particularly suitable for expression of the HG52 
protein are immortalized melanophore pigment cells from Xenopus laevis. Such 
melanophore pigment cells can be used for functional assays using recombinant 

30 expression of HG52 in a manner similar to the use of such melanophore pigment cells 
for the functional assay of other recombinant GPCRs (Graminski et aL, 1993, J. Biol. 
Chem. 268:5957-5964; Lerner, 1994, Trends Neurosci. 17:142-146; Potenza & 
Lerner, 1992, Pigment Cell Res. 5:372-378). 

A variety of mammalian expression vectors can be used to express 

35 recombinant HG52 in mammalian and other cells. Commercially available 
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mammalian expression vectors which are suitable include, but are not limited to, 
pCR2.1 (Invitrogen), pMClneo (Stratagene), pSG5 (Stratagene), pcDNAI and 
pcDNAIamp, pcDNA3, pcDNA3.1, pCR3.1 (Invitrogen), EBO-pSV2-neo (ATCC 
37593), pBPV-l(8-2) (ATCC 371 10), pdBPV-MMTneo(342-12) (ATCC 37224), 
5 pRSVgpt (ATCC 37199), pRSVneo (ATCC 37198), and pSV2-dhfr (ATCC 37146). 
For expression in non-mammalian cells, various suitable expression vectors are 
known in the art. The choice of vector will depend upon cell type used, level of 
expression desired, and the like. Following expression in recombinant cells, HG52 
can be purified by conventional techniques to a level that is substantially free from 
10 other proteins. 

The present invention includes HG52 protein substantially free from 
other proteins. The amino acid sequence of the full-length HG52 protein is shown in 
Figure 2 as SEQ.ID.NO.:2. Thus, the present invention includes HG52 protein 
substantially free from other proteins having the amino acid sequence of 

15 SEQ.ID.NO.:2. 

As with many receptor proteins, it is possible to modify many of the 
amino acids of HG52, particularly those which are not found in the ligand binding 
domain, and still retain substantially the same biological activity as the original 
receptor. Thus this invention includes modified HG52 polypeptides which have 

20 amino acid deletions, additions, or substitutions but that still retain substantially the 
same biological activity as HG52. It is generally accepted that single amino acid 
substitutions do not usually alter the biological activity of a protein (see, e.g., 
Molecular Biology of the Gene , Watson et aL 9 1987, Fourth Ed., The 
Benjamin/Cummings Publishing Co., Inc., page 226; and Cunningham & Wells, 

25 1 989, Science 244: 1 08 1 - 1 085). Accordingly, the present invention includes 

polypeptides where one amino acid substitution has been made in SEQ.ID.NO.:2 
wherein the polypeptides still retain substantially the same biological activity as 
HG52. The present invention also includes polypeptides where two or more amino 
acid substitutions have been made in SEQ.ID.NO.:2 wherein the polypeptides still 

30 retain substantially the same biological activity as HG52. In particular, the present 
invention includes embodiments where the above-described substitutions are 
conservative substitutions. In particular, the present invention includes embodiments 
where the above-described substitutions do not occur in the ligand-binding domain of 
HG52. 
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When deciding which amino acid residues of HG52 may be substituted 
to produce polypeptides that are functional equivalents of HG52, one skilled in the art 
would be guided by a comparison of the amino acid sequence of HG52 with the 
amino acid sequences of related proteins, e.g., the human thrombin receptors (see Vu 
5 et aL, 1991, Cell 64:1057-1068; Ishihara et aL, 1997, Nature 386:502-506; and Figure 
5 of the present application). Such a comparison would allow one skilled in the art to 
minimize the number of amino acid substitutions made in regions that are highly 
conserved between HG52 and the related protein. Accordingly, the present invention 
includes embodiments where the substitutions are conservative and do not occur in 
10 positions where HG52 and the human thrombin receptor 1 share the same amino acid 
(see Figure 5). 

One skilled in the art would also recognize that polypeptides that are 
functional equivalents of HG52 and have changes from the HG52 amino acid 
sequence that are small deletions or insertions of amino acids could also be produced 

15 by following the same guidelines, i.e., minimizing the differences in amino acid 
sequence between HG52 and related proteins. Small deletions or insertions are 
generally in the range of about 1 to 5 amino acids. The effect of such small deletions 
or insertions on the biological activity of the modified HG52 polypeptide can easily 
be assayed by producing the polypeptide synthetically or by making the required 

20 changes in DNA encoding HG52 and then expressing the DNA recombinantly and 
assaying the protein produced by such recombinant expression. 

The present invention also includes C-terminal truncated forms of 
HG52, particularly those which encompass the extracellular portion of the receptor, 
but lack the intracellular signaling portion of the receptor. Such truncated receptors 

25 are useful in various binding assays described herein, for crystallization studies, and 
for structure-activity-relationship studies. 

It has been found that, in some cases, membrane spanning regions of 
receptor proteins can be used to inhibit receptor function (Ng et aL, 1996, Biochem. 
Biophys. Res. Comm. 227:200-204; Hebert et aL, 1996, J. Biol. Chem. 271, 16384- 

30 16392; Lofts et aL, Oncogene 8:2813-2820). Accordingly, the present invention 
provides peptides derived from the seven membrane spanning regions of HG52 and 
their use to inhibit HG52 function. Such peptides can include whole or parts of the 
receptor membrane spanning domains. Among such peptides are peptides containing 
amino acid residues 31-49, 61-82, and 98-1 19 of SEQ.ID.NO.:2. 
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The present invention also includes chimeric HG52 proteins. Chimeric 
HG52 proteins consist of a contiguous polypeptide sequence of HG52 fused in frame 
to a polypeptide sequence of a non-HG52 protein. For example, the N-terminal 
domain and seven transmembrane spanning domains of HG52 fused at the C-terminus 
5 in frame to a G protein would be a chimeric HG52 protein. 

The present invention also includes HG52 proteins that are in the form 
of multimeric structures, e.g., dimers. Such multimers of other G-protein coupled 
receptors are known (Hebert et aL, 1996, J. Biol. Chem. 271, 16384-16392; Ng et al, 
1996, Biochem. Biophys. Res. Comm. 227, 200-204; Romano et al., 1996, J. Biol. 
10 Chem. 271, 28612-28616). 

The present invention also includes isolated forms of HG52 proteins. 

The present invention includes methods of identifying compounds that 
specifically bind to HG52 protein, as well as compounds identified by such methods. 
The specificity of binding of compounds having affinity for HG52 is shown by 
15 measuring the affinity of the compounds for recombinant cells expressing the cloned 
receptor or for membranes from those cells. Expression of the cloned receptor and 
screening for compounds that bind to HG52 or that inhibit the binding of a known, 
radiolabeled ligand of HG52 to these cells, or membranes prepared from these cells, 
provides an effective method for the rapid selection of compounds with high affinity 
20 for HG52. Such ligands need not necessarily be radiolabeled but can also be 

nonisotopic compounds that can be used to displace bound radiolabeled compounds 
or that can be used as activators in functional assays. Compounds identified by the 
above method are likely to be agonists or antagonists of HG52 and may be peptides, 
proteins, or non-proteinaceous organic molecules. 
25 Therefore, the present invention includes assays by which HG52 

agonists and antagonists may be identified. Methods for identifying agonists and 
antagonists of other receptors are well known in the art and can be adapted to identify 
agonists and antagonists of HG52. Accordingly, the present invention includes a 
method for determining whether a substance is a potential agonist or antagonist of 
30 HG52 that comprises: 

(a) transfecting cells with an expression vector encoding HG52; 

(b) allowing the transfected cells to grow for a time sufficient to 
allow HG52 to be expressed; 

(c) exposing the cells to a labeled ligand of HG52 in the presence 
35 and in the absence of the substance; 
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(d) measuring the binding of the labeled ligand to HG52; where if 
the amount of binding of the labeled ligand is less in the presence of the substance 
than in the absence of the substance, then the substance is a potential agonist or 
antagonist of HG52. 

5 The conditions under which step (c) of the method is practiced are 

conditions that are typically used in the art for the study of protein-ligand interactions: 
e.g., physiological pH; salt conditions such as those represented by such commonly 
used buffers as PBS or in tissue culture media; a temperature of about 4°C to about 
55°C. 

10 The present invention also includes a method for determining whether 

a substance is capable of binding to HG52, i.e., whether the substance is a potential 
agonist or an antagonist of HG52, where the method comprises: 

(a) providing test cells by transfecting cells with an expression 
vector that directs the expression of HG52 in the cells; 

1 5 (b) exposing the test cells to the substance; 

(c) measuring the amount of binding of the substance to HG52; 

(d) comparing the amount of binding of the substance to HG52 in 
the test cells with the amount of binding of the substance to control cells that have not 
been transfected with HG52; 

20 wherein if the amount of binding of the substance is greater in the test 

cells as compared to the control cells, the substance is capable of binding to HG52. 
Determining whether the substance is actually an agonist or antagonist can then be 
accomplished by the use of functional assays such as, e.g., the assay involving the use 
of promiscuous G-proteins described below. 

25 The conditions under which step (b) of the method is practiced are 

conditions that are typically used in the art for the study of protein-ligand interactions: 
e.g., physiological pH; salt conditions such as those represented by such commonly 
used buffers as PBS or in tissue culture media; a temperature of about 4°C to about 
55°C. 

30 In a particular embodiment of the above-described methods, the cells 

are eukaryotic cells. In another embodiment, the cells are mammalian cells. In other 
embodiments, the cells are L cells L-M(TK') (ATCC CCL 1.3), L cells L-M (ATCC 
CCL 1.2), 293 (ATCC CRL 1573), Raji (ATCC CCL 86), CV-1 (ATCC CCL 70), 
COS-1 (ATCC CRL 1650), COS-7 (ATCC CRL 1651), CHO-K1 (ATCC CCL 61), 
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3T3 (ATCC CCL 92), NIH/3T3 (ATCC CRL 1658), HeLa (ATCC CCL 2), C127I 
(ATCC CRL 1616), BS-C-1 (ATCC CCL 26) or MRC-5 (ATCC CCL 171). 

The assays described above can be carried out with cells that have 
been transiently or stably transfected with HG52. Transfection is meant to include 
5 any method known in the art for introducing HG52 into the test cells. For example, 
transfection includes calcium phosphate or calcium chloride mediated transfection, 
lipofection, infection with a retroviral construct containing HG52, and 
electroporation* 

Where binding of the substance or agonist to HG52 is measured, such 
10 binding can be measured by employing a labeled substance or agonist. The substance 

or agonist can be labeled in any convenient manner known to the art, e.g., 

radioactively, fluorescently, enzymatically. 

In particular embodiments of the above-described methods, HG52 has 

an amino acid sequence of SEQ.ID.NO.:2. 
15 The above-described methods can be modified in that, rather than 

exposing the test cells to the substance, membranes can be prepared from the test cells 

and those membranes can be exposed to the substance. Such a modification utilizing 

membranes rather than cells is well known in the art and is described in, e.g., Hess et 

aL 9 1992, Biochem. Biophys. Res. Comm. 184:260-268. 
20 Accordingly, the present invention provides a method for determining 

whether a substance is capable of binding to HG52 comprising: 

(a) providing test cells by transfecting cells with an expression 
vector that directs the expression of HG52 in the cells; 

(b) preparing membranes containing HG52 from the test cells and 
25 exposing the membranes to a ligand of HG52 under conditions such that the ligand 

binds to the HG52 in the membranes; 

(c) subsequently or concurrently to step (b), exposing the 
membranes from the test cells to a substance; 

(d) measuring the amount of binding of the ligand to the HG52 in 
30 the membranes in the presence and the absence of the substance; 

(e) comparing the amount of binding of the ligand to HG52 in the 
membranes in the presence and the absence of the substance where a decrease in the 
amount of binding of the ligand to HG52 in the membranes in the presence of the 
substance indicates that the substance is capable of binding to HG52; 

35 where HG52 has an amino acid sequence of SEQ.ID.NO.:2. 
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The present invention provides a method for determining whether a 
substance is capable of binding to HG52 comprising: 

(a) providing test cells by transfecting cells with an expression 
vector that directs the expression of HG52 in the cells; 
5 (b) preparing membranes containing HG52 from the test cells and 

exposing the membranes from the test cells to the substance; 

(c) measuring the amount of binding of the substance to the HG52 
in the membranes from the test cells; 

(d) comparing the amount of binding of the substance to HG52 in 
10 the membranes from the test cells with the amount of binding of the substance to 

membranes from control cells that have not been transfected with HG52; 

where HG52 has an amino acid sequence of SEQ.ID.NO.:2; 

where if the amount of binding of the substance to HG52 in the 
membranes from the test cells is greater than the amount of binding of the substance 
15 to the membranes from the control cells, then the substance is capable of binding to 
HG52. 

As a further modification of the above-described methods, RNA 
encoding HG52 can be prepared as, e.g., by in vitro transcription using a plasmid 
containing HG52 under the control of a bacteriophage T7 promoter, and the RNA can 

20 be microinjected into Xenopus oocytes in order to cause the expression of HG52 in 
the oocytes. Substances are then tested for binding to the HG52 expressed in the 
oocytes. Alternatively, rather than detecting binding, the effect of the substances on 
the electrophysiological properties of the oocytes can be determined. 

The present invention includes assays by which HG52 agonists and 

25 antagonists may be identified by their ability to stimulate or antagonize a functional 
response mediated by HG52. HG52 belongs to the class of proteins known as G- 
protein coupled receptors (GPCRs). GPCRs transmit signals across cell membranes 
upon the binding of ligand. The ligand-bound GPCR interacts with a heterotrimeric 
G-protein, causing the Got subunit of the G-protein to disassociate from the GP and 

30 Gy subunits. The Got subunit can then go on to activate a variety of second 
messenger systems. 

Generally, a particular GPCR is only coupled to a particular type of G- 
protein. Thus, to observe a functional response from the GPCR, it is necessary to 
ensure that the proper G-protein is present in the system containing the GPCR. It has 

35 been found, however, that there are certain G-proteins that are "promiscuous. " These 
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promiscuous G-proteins will couple to, and thus transduce a functional signal from, 
virtually any GPCR. See Offermanns & Simon, 1995, J. Biol. Chem. 270:15175- 
1 5 1 80 (Offermanns). Offermanns described a system in which cells are transfected 
with expression vectors that result in the expression of one of a large number of 
5 GPCRs as well as the expression of one of the promiscuous G-proteins Gal 5 or 
Gal 6. Upon the addition of an agonist of the GPCR to the transfected cells, the 
GPCR was activated and was able, via Gal 5 or Gal 6, to activate the P isoform of 
phospholipase C, leading to an increase in inositol phosphate levels in the cells. 

Therefore, by making use of these promiscuous G-proteins as in 

10 Offermanns, it is possible to set up functional assays for HG52, even in the absence of 
knowledge of the G-protein with which HG52 is coupled in vivo. One possibility is to 
create a fusion or chimeric protein composed of the extracellular and membrane 
spanning portion of HG52 fused to a promiscuous G-protein. Such a fusion protein 
would be expected to transduce a signal following binding of ligand to the HG52 

15 portion of the fusion protein. Accordingly, the present invention provides a method 
of identifying antagonists of HG52 comprising: 

(a) providing cells that expresses a chimeric HG52 protein fused at 
its C-terminus to a promiscuous G-protein; 

(b) exposing the cells to an agonist of HG52; 

20 (c) subsequently or concurrently to step (b), exposing the cells to a 

substance that is a suspected antagonist of HG52; 

(d) measuring the level of inositol phosphates in the cells; 
where a decrease in the level of inositol phosphates in the cells in the 
presence of the substance as compared to the level of inositol phosphates in the cells 
25 in the absence of the substance indicates that the substance is an antagonist of HG52. 

Another possibility for utilizing promiscuous G-proteins in connection 
with HG52 includes a method of identifying agonists of HG52 comprising: 

(a) providing cells that expresses both HG52 and a promiscuous 

G-protein; 

30 (b) exposing the cells to a substance that is a suspected agonist of 

HG52; 

(c) measuring the level of inositol phosphates in the cells; 
where an increase in the level of inositol phosphates in the cells as 

compared to the level of inositol phosphates in the cells in the absence of the 
35 suspected agonist indicates that the substance is an agonist of HG52. 
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Levels of inositol phosphates can be measured by monitoring calcium 
mobilization. Intracellular calcium mobilization is typically assayed in whole cells 
under a microscope using fluorescent dyes or in cell suspensions via luminescence 
using the aequorin assay. 
5 In a particular embodiment of the above-described methods, the cells 

are eukaryotic cells. In another embodiment, the cells are mammalian cells. In other 
embodiments, the cells are L cells L-M(TK~) (ATCC CCL 1 .3), L cells L-M (ATCC 
CCL 1.2), 293 (ATCC CRL 1573), Raji (ATCC CCL 86), CV-1 (ATCC CCL 70), 
COS-1 (ATCC CRL 1650), COS-7 (ATCC CRL 1651), CHO-K1 (ATCC CCL 61), 

10 3T3 (ATCC CCL 92), NIH/3T3 (ATCC CRL 1658), HeLa (ATCC CCL 2), C127I 
(ATCC CRL 1616), BS-C-1 (ATCC CCL 26) or MRC-5 (ATCC CCL 171). 

In a particular embodiment of the above-described method, the cells 
are transfected with expression vectors that direct the expression of HG52 and the 
promiscuous G-protein in the cells. 

15 The conditions under which step (b) of the method is practiced are 

conditions that are typically used in the art for the study of protein-ligand interactions: 
e.g., physiological pH; salt conditions such as those represented by such commonly 
used buffers as PBS or in tissue culture media; a temperature of about 4°C to about 
55°C. 

20 In a particular embodiment of the above-described method, the 

promiscuous G-protein is selected from the group consisting of Gal 5 or Gal 6. 
Expression vectors containing Gal 5 or Gal 6 are known in the art. See, e.g., 
Offermanns; Buhl et al. 9 1993, FEBS Lett. 323:132-134; Arnatruda et aL, 1993, J. 
Biol. Chem. 268:10139-10144. 

25 The above-described assay can be easily modified to form a method to 

identify antagonists of HG52. Such a method is also part of the present invention and 
comprises: 

(a) providing cells that expresses both HG52 and a promiscuous 

G-protein; 

30 (b) exposing the cells to a substance that is an agonist of HG52; 

(c) subsequently or concurrently to step (b), exposing the cells to a 
substance that is a suspected antagonist of HG52; 

(d) measuring the level of inositol phosphates in the cells; 
where a decrease in the level of inositol phosphates in the cells in the 

35 presence of the suspected antagonist as compared to the level of inositol phosphates 



- 15 - 



WO 00/20438 



PCT/US99/22634 



in the cells in the absence of the suspected antagonist indicates that the substance is 
an antagonist of HG52. 

In a particular embodiment of the above-described method, the cells 
are eukaryotic cells. In another embodiment, the cells are mammalian cells. In other 
5 embodiments, the cells are L cells L-M(TK~) (ATCC CCL 1 .3), L cells L-M (ATCC 
CCL 1.2), 293 (ATCC CRL 1573), Raji (ATCC CCL 86), CV-1 (ATCC CCL 70), 
COS-1 (ATCC CRL 1650), COS-7 (ATCC CRL 1651), CHO-K1 (ATCC CCL 61), 
3T3 (ATCC CCL 92), NIH/3T3 (ATCC CRL 1658), HeLa (ATCC CCL 2), C127I 
(ATCC CRL 1616), BS-C-1 (ATCC CCL 26) and MRC-5 (ATCC CCL 171). 

10 The conditions under which steps (b) and (c) of the method are 

practiced are conditions that are typically used in the art for the study of protein- 
ligand interactions: e.g., physiological pH; salt conditions such as those represented 
by such commonly used buffers as PBS or in tissue culture media; a temperature of 
about 4°C to about 55°C. 

15 In a particular embodiment of the above-described method, the cells 

are transfected with expression vectors that direct the expression of HG52 and the 
promiscuous G-protein in the cells. 

In a particular embodiment of the above-described method, the 
promiscuous G-protein is selected from the group consisting of Gal 5 or Gal 6. 

20 In particular embodiments of the above-described methods, HG52 has 

an amino acid sequence of SEQ.ID.NO.:2. 

While the above-described methods are explicitly directed to testing 
whether "a" substance is an agonist or antagonist of HG52, it will be clear to one 
skilled in the art that such methods can be adapted to test collections of substances, 

25 e.g., combinatorial libraries, to determine whether any members of such collections 
are activators or inhibitors of HG52. Accordingly, the use of collections of 
substances, or individual members of such collections, as the substance in the above- 
described methods is within the scope of the present invention. 

Agonists and antagonists of HG52 that are identified by the above- 

30 described methods should have utility in the treatment of diseases that involve the 
inappropriate expression of HG52. In particular, given the expression pattern of 
HG52 (see Figure 4), such agonists and antagonists should have utility in the 
treatment of various immune system disorders or disorders involving inflammation, 
e.g., arthritis, asthma, lupus. Such agonists and antagonists should also be useful in 

35 treating various infectious diseases. 
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Given the resemblance between HG52 and the thrombin receptors, it is 
anticipated that agonists and antagonists of HG52 will have pharmacological activity 
and thus be useful in a similar manner to the manner in which agonists and 
antagonists of the thrombin receptors are useful, viz., as modulators of the effects of 
5 thrombin. Accordingly, it is believed that agonists and antagonists of HG52 will have 
utility in modulating: the conversion of fibrinogen to fibrin in plasma; the activation 
of clotting factors V, VIII, XIII, and protein C; the procoagulant function of platelets 
and endothelial cells; the stimulation of platelet activation; chemotaxis of monocytes 
and lymphocytes; mitogenesis of lymphocytes and mesenchymal cells such as 
1 0 vascular smooth muscle cells, fibroblasts, and epithelial cells. 

The present invention includes pharmaceutical compositions 
comprising agonists and antagonists of HG52. The agonists and antagonists are 
generally combined with pharmaceutically acceptable carriers to form pharmaceutical 
compositions. Examples of such carriers and methods of formulation of 
15 pharmaceutical compositions containing agonists and antagonists and carriers can be 
found in Remington's Pharmaceutical Sciences. To form a pharmaceutically 
acceptable composition suitable for effective administration, such compositions will 
contain a therapeutically effective amount of the agonists and antagonists. 

Therapeutic or prophylactic compositions are administered to an 
20 individual in amounts sufficient to treat or prevent conditions where HG52 activity is 
abnormal. The effective amount can vary according to a variety of factors such as the 
individual's condition, weight, sex and age. Other factors include the mode of 
administration. The appropriate amount can be determined by a skilled physician. 

Compositions can be used alone at appropriate dosages. Alternatively, 
25 co-administration or sequential administration of other agents can be desirable. 

The compositions can be administered in a wide variety of therapeutic 
dosage forms in conventional vehicles for administration. For example, the 
compositions can be administered in such oral dosage forms as tablets, capsules (each 
including timed release and sustained release formulations), pills, powders, granules, 
30 elixirs, tinctures, solutions, suspensions, syrups and emulsions, or by injection. 
Likewise, they can also be administered in intravenous (both bolus and infusion), 
intraperitoneal, subcutaneous, topical with or without occlusion, or intramuscular 
form, all using forms well known to those of ordinary skill in the pharmaceutical arts. 

Advantageously, compositions can be administered in a single daily 
35 dose, or the total daily dosage can be administered in divided doses of two, three or 
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four times daily. Furthermore, compositions can be administered in intranasal form 
via topical use of suitable intranasal vehicles, or via transdermal routes, using those 
forms of transdermal skin patches well known to those of ordinary skill in that art. To 
be administered in the form of a transdermal delivery system, the dosage 
5 administration will, of course, be continuous rather than intermittent throughout the 
dosage regimen. 

The dosage regimen utilizing the compositions is selected in 
accordance with a variety of factors including type, species, age, weight, sex and 
medical condition of the patient; the severity of the condition to be treated; the route 

10 of administration; the renal, hepatic and cardiovascular function of the patient; and 
the particular composition thereof employed. A physician of ordinary skill can 
readily determine and prescribe the effective amount of the composition required to 
prevent, counter or arrest the progress of the condition. Optimal precision in 
achieving concentrations of composition within the range that yields efficacy without 

15 toxicity requires a regimen based on the kinetics of the composition's availability to 
target sites. This involves a consideration of the distribution, equilibrium, and 
elimination of a composition. 

The present invention also includes methods of expressing HG52 in 
recombinant systems and then utilizing the recombinantly expressed HG52 for 

20 counter-screening. When screening compounds in order to identify potential 

pharmaceuticals that specifically interact with a target receptor, it is necessary to 
ensure that the compounds identified are as specific as possible for the target receptor. 
To do this, it is necessary to screen the compounds against as wide an array as 
possible of receptors that are similar to the target receptor. Thus, in order to find 

25 compounds that are potential pharmaceuticals that interact with receptor A, it is 

necessary not only to ensure that the compounds interact with receptor A (the "plus 
target") and produce the desired pharmacological effect through receptor A, it is also 
necessary to determine that the compounds do not interact with receptors B, C, D, etc. 
(the "minus targets"). In general, as part of a screening program, it is important to 

30 have as many minus targets as possible (see Hodgson, 1992, Bio/Technology 10:973- 
980, at 980). Therefore, HG52 proteins and DNA encoding HG52 proteins have 
utility in counter-screens. That is, they can be used as "minus targets" in counter- 
screens in connection with screening projects designed to identify compounds that 
specifically interact with other G-protein coupled receptors. 
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The DNA of the present invention, or hybridization probes based upon 
the DNA, can be used in chromosomal mapping studies in order to identify the 
chromosomal locations of the HG52 gene or of genes encoding proteins related to 
HG52. Such mapping studies can be carried out using well-known genetic and/or 
5 chromosomal mapping techniques such as, e.g., linkage analysis with respect to 

known chromosomal markers or in situ hybridization. See, e.g., Verma et al., 1988, 
Human Chromosomes: A Manual of Basic Techniques , Pergamon Press, New York, 
NY. After identifying the chromosomal location of the HG52 gene or genes encoding 
proteins related to HG52, this information can be compared with the locations of 

10 known disease-causing genes contained in genetic map data (such as the data found in 
the Genome Issue of Science ( 1 994, 265 : 1 98 1 -2 1 44). In this way, one can correlate 
the chromosomal location of the HG52 gene or of genes encoding proteins related to 
HG52 with the locations of known disease-causing genes and thus help to limit the 
region of DNA containing such disease-causing genes. This will simplify the process 

15 of cloning such disease-causing genes. Also, once linkage between the chromosomal 
location of the HG52 gene or of genes encoding proteins related to HG52 and the 
location of a known disease-causing gene is established, that linkage can be used 
diagnostically to identify restriction fragment length polymorphisms (RFLPs) in the 
vicinity of the HG52 gene or of genes encoding proteins related to HG52. Such 

20 RFLPs will be associated with the disease-causing gene and thus can be used to 
identify individuals carrying the disease-causing gene. 

For such chromosomal mapping studies as described herein, it may be 
advantageous to use, in addtion to the DNA of the present invention, the reverse 
complement of the DNA of the present invention or RNA corresponding to the DNA 

25 of the present invention . 

The present invention also includes antibodies to the HG52 protein. 
Such antibodies may be polyclonal antibodies or monoclonal antibodies and are 
useful in treating disorders of the immune system that involve the inappropriate 
expression or activity of the HG52 protein. The antibodies of the present invention 

30 are raised against the entire HG52 protein or against suitable antigenic fragments of 
the protein that are coupled to suitable carriers, e.g., serum albumin or keyhole limpet 
hernocyanin, by methods well known in the art. Methods of identifying suitable 
antigenic fragments of a protein are known in the art. See, e.g., Hopp & Woods, 
1981, Proc. Natl. Acad. Sci. USA 78:3824-3828; and Jameson & Wolf, 1988, 

35 CABIOS (Computer Applications in the Biosciences) 4:181-186. 
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For the production of polyclonal antibodies, HG52 protein or an 
antigenic fragment, coupled to a suitable carrier, is injected on a periodic basis into an 
appropriate non-human host animal such as, e.g., rabbits, sheep, goats, rats, mice. 
The animals are bled periodically and sera obtained are tested for the presence of 
5 antibodies to the injected antigen. The injections can be intramuscular, 

intraperitoneal, subcutaneous, and the like, and can be accompanied with adjuvant. 

For the production of monoclonal antibodies, HG52 protein or an 
antigenic fragment, coupled to a suitable carrier, is injected into an appropriate non- 
human host animal as above for the production of polyclonal antibodies. In the case 

10 of monoclonal antibodies, the animal is generally a mouse. The animal's spleen cells 
are then immortalized, often by fusion with a myeloma cell, as described in Kohler & 
Milstein, 1975, Nature 256:495-497. For a fuller description of the production of 
monoclonal antibodies, see Antibodies: A Laboratory Manual , Harlow & Lane, eds., 
Cold Spring Harbor Laboratory Press, 1988. 

15 Gene therapy may be used to introduce HG52 polypeptides into the 

cells of target organs. Nucleotides encoding HG52 polypeptides can be ligated into 
viral vectors which mediate transfer of the nucleotides by infection of recipient cells. 
Suitable viral vectors include retrovirus, adenovirus, adeno-associated virus, herpes 
virus, vaccinia virus, and polio virus based vectors. Alternatively, nucleotides 

20 encoding HG52 polypeptides can be transferred into cells for gene therapy by non- 
viral techniques including receptor-mediated targeted transfer using ligand-nucleotide 
conjugates, lipofection, membrane fusion, or direct microinjection. These procedures 
and variations thereof are suitable for ex vivo as well as in vivo gene therapy. Gene 
therapy with HG52 polypeptides will be particularly useful for the treatment of 

25 diseases where it is beneficial to elevate HG52 activity. 

A cDNA fragment encoding full-length HG20 can be isolated from an 
appropriate human cDNA library by using the polymerase chain reaction (PCR) 
employing suitable primer pairs. Such primer pairs can be selected based upon the 
cDNA sequence for HG20 shown in Figure 1 as SEQ.ID.NO.:l. Suitable primer pairs 

30 would be, e.g. : 

ACCCCTCCAGGATGCAGG (SEQ.ID.NO.:4) and 
ACTCAGAACACACTCTCC (SEQ.ID.NO.:5). 
The above primers are meant to be illustrative. One skilled in the art 
would recognize that a variety of other suitable primers can be designed. For 



-20- 



WO 00/20438 



PCT/US99/22634 



example, the primers HG52.NotI (SEQ.ID.NO.:6) and HG52.KpnI (SEQ.ID.NO.:7), 
see Example 1, could be used. 

PCR reactions can be carried out with a variety of thermostable 
enzymes including but not limited to AmpliTaq, AmpliTaq Gold, or Vent polymerase. 
5 For AmpliTaq, reactions can be carried out in 10 mM Tris-Cl, pH 8.3, 2.0 mM 

MgCl2, 200 |LiM for each dNTP, 50 mM KC1, 0.2 |iM for each primer, 10 ng of DNA 

template, 0.05 units/|il of AmpliTaq. The reactions are heated at 95°C for 3 minutes 
and then cycled 35 times using the cycling parameters of 95°C, 20 seconds, 62°C, 20 
seconds, 72°C, 3 minutes. In addition to these conditions, a variety of suitable PCR 

10 protocols can be found in PCR Primer, A Laboratory Manual , edited by C.W. 

Dieffenbach and G.S. Dveksler, 1995, Cold Spring Harbor Laboratory Press; or PCR 
Protocols: A Guide to Methods and Applications , Michael et aL, eds., 1990, 
Academic Press . 

A suitable cDNA library from which a clone encoding HG52 can be 

1 5 isolated would be a human cDNA library made from RNA from peripheral blood 
lymphocytes, spleen, lymph nodes, or bone marrow. Such libraries can be prepared 
by methods well-known in the art. Alternatively, several commercially available 
libraries would be suitable,. e.g., the human leukocyte 5'-stretch plus cDNA library 
and human spleen 5-stretch plus cDNA library of Clontech Laboratories, Inc. (Palo 

20 Alto, CA, USA) . The primary clones of such libraries can be subdivided into pools 
with each pool containing approximately 20,000 clones and each pool can be 
amplified separately. 

By this method, a cDNA fragment encoding an open reading frame of 
359 amino acids (SEQ.ID.NO.:2) can be obtained. This cDNA fragment can be 

25 cloned into a suitable cloning vector or expression vector. For example, the fragment 
can be cloned into the mammalian expression vector pcDNA3.1 (Invitrogen, San 
Diego, Ca). HG52 protein can then be produced by transferring an expression vector 
encoding HG52 or portions thereof into a suitable host cell and growing the host cell 
under appropriate conditions. HG52 protein can then be isolated by methods well 

30 known in the art. 

As an alternative to the above-described PCR method, a cDNA clone 
encoding HG52 can be isolated from a cDNA library using as a probe 
oligonucleotides specific for HG52 and methods well known in the art for screening 
cDNA libraries with oligonucleotide probes. Such methods are described in, e.g., 

35 Sambrook et al, 1989, Molecular Cloning: A Laboratory Manual; Cold Spring 
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Harbor Laboratory, Cold Spring Harbor, New York; Glover, D.M. (ed.), 1985, DNA 
Cloning: A Practical Approach , MRL Press, Ltd., Oxford, U.K., Vol. I, II. 
Oligonucleotides that are specific for HG52 and that can be used to screen cDNA 
libraries can be readily designed based upon the cDNA sequence of HG52 shown in 
5 Figure 1 as SEQ.ID.NO.:l and can be synthesized by methods well-known in the art. 

Genomic clones containing the HG52 gene can be obtained from 
commercially available human PAC or BAC libraries, e.g., from Research Genetics, 
Huntsville, AL. Alternatively, one may prepare genomic libraries, especially in PI 
artificial chromosome vectors, from which genomic clones containing the HG52 can 
10 be isolated, using probes based upon the HG52 nucleotide sequences disclosed herein. 
Methods of preparing such libraries are known in the art (Ioannou et #/.,1994, Nature 
Genet. 6:84-89). 

The following non-limiting examples are presented to better illustrate 

1 5 the invention. 

EXAMPLE 1 

Cloning and sequencing of HG52 

An EST (GenBank accession number aa804531) was identified as a 
20 putative G-protein coupled receptor. Primers HG52.F91 and HG52.R242 were 
designed from the EST sequence and used to screen a superpool plasmid library 
derived from tissues of fetal brain, prostate, placenta, and testis. RCCA was then 
performed to isolated the full-length coding sequence of HG52 from individual 
library pools. In the primary reaction, primer HG52.F91 was paired with either 
25 PBS.873F or PBS.543R and HG52.R310 was paired with either PBS.873F or 

PBS.543R. In the secondary reaction, primer HG52.F165 was paired with PBS.838F 
or PBS.578R and HG52.R242 was paired with PBS.838F or PBS.578R. PCR 
products were sequenced and assembled with the aa804531 EST as a guide. Primers 
HG52.506F and HG52.603R were designed from new sequence and used to identify 
30 new pools in the superpool and tissue specific plasmid libraries. RCCA was carried 
out on these new pools with the following primer combinations: HG52.506F with 
PBS.543R or PBS.873F and HG52.623R with PBS.543R or PBS.873F. PCR 
products were sequenced and assembled. Nested RCCA was then done to identify 
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more specific and longer PCR products. In the primary reaction, HG52.357F was 
paired with either PBS.543R or PBS.873F and HG52.623R was paired with 
PBS.543R or PBS.873F. In the secondary reaction, HG52.F506 was paired with 
either PBS.578R or PBS.838F and HG52.526R was paired with either PBS.578R or 
PBS.838F. PCR products were purified, sequenced, and assembled into a contig. In 
order to identify new 5' transcripts, a human leukocyte phage library was scanned 
with primers HG52.357F and HG52.623R. Nested RCCA was then done on the 
positive wells using the following primer combinations: In the primary reaction, 
HG52.357F was paired with either lambda gtlO 3' or lambda gtlO 5' and HG52.623R 
with lambda gtlO 3' or lambda gtlO 5'. In the secondary reaction, HG52.506F was 
paired with either lambda gtlO 3' or lambda gtlO 5' and HG52.526R was paired with 
lambda gtlO 3' or lambda gtlO 5'. PCR products were then purified and sequenced 
and assembled into the contig. From these PCR extensions and sequencing, an open 
reading frame of l,080bp encoding a polypeptide sequence of 359 amino acids was 
obtained. The full length cDNA encoding HG52 was then amplified using primers 
HG52.NotI and HG52.KpnI and subcloned into the expression vector pcDNA3.1(+). 
The sequences of the primers used to sequence and clone HG52 are listed below: 



PRIMERS 



HG52.NotI GCGGCCGCTCCCTGAACCTCTGGCACC (SEQ.ID.NO.:6) 

HG52.KpnI GCGGTACCATGCAGGTCCCGAAC (SEQ.ID.NO.:7) 

HG52.165F GCTACTTCTGCCGCTGCTTC (SEQ.ID.NO.:8) 

HG52.310R TGCTGACCGCCCCCACCA (SEQ.ID.NO.:9) 

HG52.506F ACCATTGCAACCGCCACCA (SEQ.ID.NO.:10) 

HG52.623R GGTACAGGACCCCCAGGAAGC (SEQ.ID.NO.:ll) 

HG52.357F GGTGGCGGCGGTCAGCAT (SEQ.ID.NO.:12) 

HG52.526R AAGCAGCACCCCGAATACCCA (SEQ.ID.NO.:13) 

HG52.F91 TCCCGGGCTCTGAGGCAC (SEQ.ID.NO.:14) 

HG52.R242 TGCAGCGTCGCGTTGTCC (SEQ.ID.NO.:15) 

PBS.543R GGGGATGTGCTGCAAGGCGA (SEQ.ID.NO.:16) 

PBS.578R CCAGGGTTTTCCCAGTCACGAC (SEQ.ID.NO.:17) 

PBS.873F CCCAGGCTTTACACTTTATGCTTCC (SEQ.ID.NO.:18) 

PBS.838F TTGTGTGG A ATTGTG AGC G G AT AAC (SEQ.ID.NO.:19) 
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EXAMPLE 2 

Tissue distribution of HG52 RNA transcripts 

The data shown in Figure 4 were obtained by the use of human multi- 
tissue Northern blots purchased from Clontech (Palo Alto, CA). Hybridizations were 
5 carried out as suggested by Clontech using the entire coding region of HG52 as probe. 

The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in 
addition to those described herein will become apparent to those skilled in the art 
10 from the foregoing description. Such modifications are intended to fall within the 
scope of the appended claims. 

Various publications are cited herein, the disclosures of which are 
incorporated by reference in their entireties. 
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